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INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT International application No. PCT/GB98/02259 

I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments.): 

Description, pages: 

1-13 as originally filed 

Claims, No.: 

1-21 as originally filed 

Drawings, sheets: 

1/19-19/19 as originally filed 

2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

3. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

4. Additional observations, if necessary: 
II. Priority 

1 . □ This report has been established as if no priority had been claimed due to the failure to furnish within the 

prescribed time limit the requested: 

□ copy of the earlier application whose priority has been claimed. 

□ translation of the earlier application whose priority has been claimed. 

2. □ This report has been established as if no priority had been claimed due to the fact that the priority claim has 

been found invalid. 
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Thus tor the purposes of this report, the international tiling date indicated above is considered to be the relevant date. 
3. Additional observations, if necessary: 
see separate sheet 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non-obvious), 
or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
ES claims Nos. 14,15, 16. 

because: 

□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 

□ the description, claims or drawings (indicate particular elements below) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 

H the claims, or said claims Nos. 14 are so inadequately supported by the description that no meaningful 
opinion could be formed. 

G3 no international search report has been established for the said claims Nos. 15, 16. 
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V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 

1 . Statement 



Novelty (N) 


Yes 


Claims 


3. 12. 13, 17-20 




No: 


Claims 


1, 2. 4-11, 21 


Inventive step (IS) 


Yes. 


Claims 


none 




No: 


Claims 


1-13, 17-21 


Industrial applicability (IA) 


Yes: 


Claims 


1-13, 17, 18, 20. 21 




No: 


Claims 


19 (reserved opinion) 



2. Citations and explanations 
see separate sheet 

VI. Certain documents cited 

1. Certain published documents (Rule 70.10) 
and / or 

2. Non-written disclosures (Rule 70.9) 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 

see separate sheet 
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II. PRIORITY 

1 ) This first preliminary written opinion has been established considering the priority 
date 01 .08.97 as a valid date. The Applicant is reminded that documents: 

WO 97 37223 A (UNIV NORTH CAROLINA) 9 October 1997 
OHARA O. ET AL. in EMBL DATABASE, 5 December 1997 

cited in the international search report may become relevant after consideration of 
the priority document which is unavailable at present. 

III. NON-ESTABLISHMENT OF OPINION 

2) The subject-matter of Claim 14 is not supported by the description and thus, it is 
not amenable to examination of novelty, inventive step or industrial applicability. 
Said claim relates to a method for identifying a compound capable of modulating 
the activity of a ZGGBP1 protein. However, a function of said protein has not been 
demonstrated. Sequence analysis of the encoding gene has revealed several 
structural domains which correspond to several potential functions (p. 2). 
Furthermore, sequence homology of the encoding gene with at least two different 
genes, nedd-4 and Pub3, indicates even more potential functions (p. 3). The 
application as filed, does not provide any indication as to which of all the potential 
activities of ZGGBP1 the method of Claim 14 is directed to. Since the method of 
said claim is not supported by the description, an opinion cannot be established . 

3) No opinion is established for the subject-matter of Claims 15 and 16 because 
said claims relate to inventions in respect of which no international search report 
has been established (Rule 66.1(e) PCT). 

V. REASONED STATEMENT UNDER ARTICLE 35(2) 

4) The present application relates to the isolation of a cDNA clone comprising the 
sequence presented in SEQ ID NOT, called gene ZGGBP1. Said sequence 
resides, along with many other expressed sequences, in the 18q21 chromosomal 
region which was shown previously to be associated with the neurological 
disorder bipolar affective disorder. The ZGGBP1 gene sequence appears to have 
85% identity at the amino acid level with the human gene encoding nedd-4 which 
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was shown previously to be a negative regulator of a sodium channel which is 
deleted in Liddle's syndrome (a hereditary form of hypertension). The ZGGBP1 
gene sequence shares also 100% identity, over a 3kb region, with the previously 
identified gene, Pub3 which may be involved in regulation of cell cycle of 
eukaryotic cells. The Applicant has not demonstrated that said ZGGBP1 gene has 
any function. 

5) The subject-matter of Claims 1, 2, 4-11, 21 is not novel as required by Article 
33(2) PCT. 

Claim 1 relates to a polynucleotide comprising a nucleic acid sequence encoding 
the polypeptide of SEQ ID NO 2, and homologues and fragments. 

Figure 2 of the present application discloses the amino acid sequence of the 
known human nedd-4 gene which is 85% homologous to polypeptide of SEQ ID 
NO 2 (p.1 1). Figure 5 of the present application discloses the nucleotide 
sequence of the known gene Pub-3 which comprises fragments up to 3 kb long 
which are identical to the polynucleotide encoding SEQ ID NO 2. 

Thus, the subject-matter of said claim has been disclosed in the prior art. 

Similar arguments apply for the subject-matter of Claims 2, 4-11, 21. 

6) The subject-matter of Claims 3, 12, 13, 17-20 is not inventive as required by 
Article 33(3) PCT. 

Said claims refer to subject-matter as defined in Claims 1-7, 10 or 11 and 

comprise additional technical features which may render said subject-matter 
novel. However, said technical features represent standard options available to 
the skilled person aware of the state of the art in the field of gene technology. 
Thus, said technical features do not impart any inventiveness to said subject- 
matter. Consequently, even if it is assumed that the subject-matter of Claims 3, 
12, 13, 17-20 is novel, said claims do not comprise an inventive step. 

7) Claim 19 is directed to the use of a polynucleotide in gene therapy. Said claim is 
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thus, considered to be directed to method of treatment of the human or animal 
body. For the assessment of claims as far as they are directed to a method of 
treatment of the human or animal body or to a diagnostic method practised on the 
human or animal body, no unified criteria exist in the PCT, on the question 
whether they are industrially applicable. The patentability can be dependent upon 
the formulation of the claims. 

VI. CERTAIN DOCUMENTS CITED 

8) The following documents are cited under Rule 70.10 PCT 

WO 97 37223 A, 9.10.97, filed 03.04.97, with priority date of 03.04.96 

VIII. CERTAIN OBSERVATIONS ON THE INTERNATIONAL APPLICATION 

9) The Applicant is reminded that the claims must be comprehensible from the 
technical point of view and clearly define the object of the invention, that is to say 
indicate all the essential features thereof (Rule 6 PCT). The subject-matter of 
Claims 3, 7, 17 and 21 does not fulfil this condition, as the claimed nucleic acid is 
only defined by the name of the encoded protein "ZGGBPT without disclosing 
any technical feature which unambiguously characterizes the claimed subject- 
matter. A gene being a chemical product should be clearly defined by its formula 
i.e. its nucleotide sequence. 

10) Claim 1 is drafted towards an isolated nucleic acid with specified sequence 
content. However, the function of said nucleic acid is not clearly stated in the 
claim. 

The Applicant speculates in the description that said nucleic acid encodes a 
protein which may be associated with the bipolar affective disorder although no 
evidence for such association is presented. Furthermore, even if such an 
association of the claimed protein with the bipolar affective disorder is established, 
a function of the protein is not defined nor could be speculated, especially since 
sequence analysis of the protein appears to indicate several potential functions. 
Since there is no conclusive demonstration of any function of the polypeptide of 
SEQ ID NO 2. as such, any function can, at best, be accepted as speculative. At 
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this point, it is not apparent what problem said polypeptide or fragments thereof 
are intended to solve or whether said sequences represent a solution comprising 
an inventive step. Insufficient disclosure of essential technical features, such as 
the function of a given nucleotide or amino acid sequence, does not allow for the 
acknowledgement of an inventive step involved in solving a technical problem. 
The lack of sufficient disclosure of the invention is contrary to Article 5 PCT and 
makes it difficult if not impossible to examine the novelty and inventive step of the 
claimed invention. 

The same arguments apply to the subject-matter of Claims 2-7 and 10-12. 

1 1) The subject-matter of Claim 11 is not clear as required by Rule 6 PCT. Said claim 
relates to a polypeptide comprising the amino acid sequence of SEQ ID NO 4. As 
said sequence is being disclosed only in the sequence listing without any mention 
of it in the description, it is not entirely clear what relation it bears with the 
invention as disclosed in the present application. 
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PCT/GB 
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A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/00 C07K14/435 C12N9/10 



C12Q1/68 



According To international Patent Oasstf ication dPC) or to bom national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searcned (ciassrfication system followed by c:assrfication symbols! 

IPC 6 C07K C12N 



Documentation searcned other tnan minimum documentation to me extent mat sucn documents are ir.ciuaec in tne tieias searched 



Electronic data Dase consulted during tne international searcn (name ot data base ana, wnere practical, searcn terms usea> 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category Citation ot document, with indication wnere appropriate ot the relevant passages 



Relevant to claim No 



A 



X,P 



WO 97 37223 A (UNIV NORTH CAROLINA) 
9 October 1997 

see abstract 

see page 9. line 1 - page 10. line 23 
see figure 23 
see claim 48 

see Nos.125 and 126 of Sequence Listing 

OHARA 0. ET AL . : "Prediction of the 
sequences of unidentified human genes. 
VIII. The complete senquences of 77 new 
cDNA clones from brain which can code for 
large proteins in vitro" 

EMBL DATABASE. 5 December 1997, XP002087609 
HEIDELBERG. DE 
AC: AB007899 



6,10, 
12-14. 
18-21 
1,2,4 



1,2,4. 
8-10, 18, 
21 



Further documents are listed in the continuation ot box C 



Patent (amity members are listed in annex 



J Special categories ot cited documents 

"A" document defining tne general state of the art which is not 
considered to be ot particular relevance 

"E " earlier document but published on or after the international 
filing date 

T" document which may throw douets on priority ciaimts) or 
which rs cited to establish the publication date ot another 
citation or other special reason (as specrtied) 

"O" document referring to an oral disclosure, use. exhibition or 
other means 

"P" document published prior to the international filing date but 
tater than the priority date claimed 



T" later document published after the international tiling date 
or pnonty date and not m conttict wrtn the application but 
cited to understand the principle or tneory underlying the 
invention 

"X" document of particular relevance: the claimed invention 
cannot be considered novel or cannot be considered to 
mvotve an inventive step when the document is taken aione 

"Y" document ot particular relevance the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments such combination being obvious to a person skilled 
in the art 

document member of the same patent family 



Date of the actual completion or tne international searcn 
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Date ot mailing of the international search report 

12/01/1999 



Name and mailing address of the (SA 

European Patent Office. P 9 5818 Patentlaan 2 
NL - 2280 HV Ri|swi|k 
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Authorized officer 
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relevant to ctaim No 
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STINE O.C. ET AL ■ "Evidence for linkano 

of bipolar disorder to chromosome 18 with 

a parent-of-origin effect" 

AMERICAN JOURNAL OF HUMAN GENETICS, 

vol. 57, no. 6, 1995, pages 1384-1394 

XP002087610 

US 

cited in the application 
see the whole document 




A 


MORS 0. ET AL.: "Cytogenetic 
abnormalities on chromosome 18 associated 
with bipolar affective disorder or 
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BRITISH JOURNAL OF PSYCHIATRY, 

vol. 170, March 1997, pages 278-280 

XP002087611 

GB 
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Box t Observations where certain claims were found un searchable (Continuation of item 1 of first sheet) 

Th,s international Search Report has not Deen established m respect of certa.n claims under Article I7( 2 )(a) tor me following reasons. 



1 ! | Claims Nos.: 

because tney relate 10 suD,ect matter not required to be searched by this Autnonty. namely 



2. I X I Claims Nos 



CLAIMS 15. 16 

because thev relate to parts of me international Application that do not comply with the prescr.beo retirements to*um 
an extent mat no meaningful International Searcn can be earned out. specific^ P rescrit3ea requirements tosucn 

SEE FURTHER INFORMATION SHEET PCT/ISA/210 



3 ' \ Claims Nos 

because tnev are dependent claims and are not drafted n accordance with the second and th.rd sentences of Ruie6 4(ai 

Box II Observations where unity of in vention is lacking (Continuation of item 2 of first sheet) 

Tn,s international Searcn.ng Authority found multiple mvennons .n tn.s international application, as follows 



1 I I L^rn^^ ^ da,tl ° nal search tees were Un] ^ ™ aopneam tms international Search Report c 



' 3rVnVa e aa»Iona^ ^ ^ SearChed '^mymg an additional fee tn,s Authority did not .nvtec 



3 LJ rLV^n^l 0 * ^ reau ' rea addlt,onai searcn fees were timely P a>d bv the applicant. m,s international Searcn Report 
covers only those claims for which tees were paid, specifically cla.ms Nos. searcn Heport 



4 — ' resm^ro f h ? n°2nl teeS were nme,v Da ' d &V the a °D 1,ca ™ Conseauently. th* international Search Report , 

restricted to the invention first mentioned m the claims: ,t is covered oy ciaims Nos . P 



Remark on Protest 



l~ ' Tne additional search fees were accompanied by the applicants protest 
i i ,No protest accompanied the payment of additional search tees 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

CLAIMS NOS. : 15, 16 ™ ~ 

A search for the claims 15 and 16, respectively relating to a compound 
modulating the activity of the protein of claims 1 and 2 and to a substance 
modulating the compound identified by the method of claim 14, could not be 
performed, since the subject-matter is not sufficiently disclosed 
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Sheet No. ..4.. 



Filing date 
of earlier application 
(day/month/year) 



item (I) 

01 August 1997 



item (2) 



(01.08.97^ 



Number 
of earlier application 



9716162.4 



| 1 Further priority claims an 

Where earlier application is: 




ited in the Supplemental Box. 



national application 
country 



GB 



regional application:* 
regional Office 



international application: 
receiving Offi ce^ 



item (3) 



[X"| Th c receiving Office is requested to prepare and transmit to the International Bureau a certified copy 
of the earlier application(s) (only if the earlier application was filed with the Office which for the 
purposes of the present international application is the receiving Office) identified above as item(s): item 



+ Where the earlier application is an ARIPO application, it is mandatory to indicate in the Supplemental Box at least one country party to the Paris 
Convention for the Protection of Industrial Property for which that earlier application was filed (Rule 4J0(b)(ii)). See Supplemental Box, 



Box No. VII INTERNATIONAL SEARCHING AUTHORITY 



Choiccof International Searching Authority (ISA) 
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Request to use results of earlier search; reference to that search (if an earlier 
search has been carried out by or requested from the International Searching Authority): 
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3. □ copy of general power of attorney; reference number, if any: 
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6. □ translation of international application into (language): 
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Figure of the drawings which 
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Neil Godfrey Alasdair PHILLIPS 
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1 . Date of actual receipt of the purported 
international application: 


2. Drawings: 


3. Corrected date of actual receipt due to later but 
timely received papers or drawings completing 
the purported international application: 


| | received: 


4. Date of timely receipt of the required 
corrections under PCT Article 1 1(2): 


| 1 not received: 


5. International Searching Authority TC , , 
(if two or more are competent): l^A / 


6. 1 1 Transmittal of search copy delayed 
1 1 until search fee is paid 




Date of receipt of the record copy 1 
by the International Bureau: 1 
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INTERNATIONAL SEARCH REPORT 

(PCT Article 18 and Rules 43 and 44) 



Applicant's or agent's file reference 

PHM 70251/WO 


FOR FURTHER see Notification of Transmittal of International Search Report 

(Form PCT/ISA/220) as well as. where applicable, item 5 below. 

ACTION 


International application No. 

PCT/GB 98/02259 


International filing date (day/month/year) 

28/07/1998 


(Earliest) Priority Date (day/month/year) 

01/08/1997 


Applicant 

ZENECA LIMITED et al . 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. 

This International Search Report consists of atotal of 5 sheets. 

| | It is also accompanied by a copy of each priorart document cited in this report. 



1 [^] Certain claims were found unsearchable (see Box I) 

2 \^\ Unity of invention Is lacklngisee Box II). 



The international application contains disclosure of a nucleotide and/or amino acid sequence listing and the 

international search was carried out on the basis of the sequence listing 

|~X~] filed with the international application, 

| | furnished by the applicant separately from the international application, 

| | but not accompanied by a statement to the effect that it did not include 

matter going beyond the disclosure in the international application as filed. 

| | Transcribed by this Authority 



4. With regard to the title, 



| | the text is approved as submitted by the applicant 

| X I the text has been established by this Authorityto read as follows 



ZGGBPl, NOVEL PEPTIDES RELATED TO BIPOLAR AFFECTIVE DISORDER TYPE 1, SEQUENCES 
AND USES THEREOF 



5. With regard to the abstract, 



f)("| the text is approved as submitted by the applicant 

| | the text has been established, according to Rule 38.2(b). by this Authority as it appears in 
Box III. The applicant may. within one month fromthe date of mailing of this International 
Search Report, submit comments to this Authority. 



6. The figure of the drawings to be published with the abstract is: 

Figure No as suggested by the applicant. 

| | because the applicant failed to suggest a figure. 

| | because this figure better characterizes the invention 



|~X~| None of the figures 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) forthe following reasons: 
1 . | | Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. \Y\ Claims Nos.: CLAIMS 15, 16 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out. specifically: 

SEE FURTHER INFORMATION SHEET PCT/ISA/210 

3. | | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking(Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1 . As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims. 

2. | | As all searchable claims could be searched without effort justifying an additional fee. this Authority did not invitepayment 

of any additional fee. 

3. As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid. specifically claims Nos.: 



4. | ] No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims: it is covered by claims Nos.: 



Remark on Protest | j The additional search fees were accompanied by the applicant's protest 

| [ No protest accompanied the payment of additional search fees. 
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CLAIMS NOS. : 15, 16 

A search for the claims 15 and 16, respectively relating to a compound 
modulating the activity of the protein of claims 1 and 2 and to a substance 
modulating the compound identified by the method of claim 14, could not be 
performed, since the subject-matter is not sufficiently disclosed. 
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A search for the claims 15 and 16, respectively relating to a compound 
modulating the activity of the protein of claims 1 and 2 and to a 
substance modulating the the compound identified by the method of claim 
14, could not be performed, since the subject-matter is not sufficiently 
di scl osed . 
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IPC 6 C12N15/00 C07K14/435 C12N9/10 C12Q1/68 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07K C12N 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category c 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No 


X,P 


WO 97 37223 A (UNIV NORTH CAROLINA) 


6,10, 




9 October 1997 


12-14, 






18-21 
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see abstract 


1,2,4 




see page 9, line 1 - page 10, line 23 






see figure 23 






see claim 48 






see Nos.125 and 126 of Sequence Listing 
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OHARA 0. ET AL.: "Prediction of the 
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large proteins in vitro" 
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AC: AB007899 
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Patent family members are listed in annex. 



0 Special categories of cited documents : 

"A 1 document defining the general state of the art which is not 
considered to be of particular relevance 

"E' earlier document but published on or after the international 
filing date 

"L" document which may throw doubts on pnonty claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document refernng to an oral disclosure, use. exhibition or 
other means 

"P document published prior to the international filing date but 
later than the priority date claimed 



later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 

11 December 1998 


Date of mailing of the international search report 

12/01/1999 


Name and mailing address of the ISA 

European Patent Office, P.B 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 

Te: ( *31 -70) 340-2040. Tx 3l65leponl. 
Fax (-31-701 340-3016 


Authorized officer 

Panzica, G 
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Category '• Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No 



STINE O.C. ET AL.: "Evidence for linkage 

of bipolar disorder to chromosome 18 with 

a parent-of-origin effect" 

AMERICAN JOURNAL OF HUMAN GENETICS, 

vol. 57, no. 6, 1995, pages 1384-1394, 

XP002087610 

US 

cited in the application 
see the whole document 

MORS 0. ET AL . : "Cytogenetic 
abnormalities on chromosome 18 associated 
with bipolar affective disorder or 
schi zophreni a" 

BRITISH JOURNAL OF PSYCHIATRY, 

vol. 170, March 1997, pages 278-280, 

XP002087611 
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(PCT Article 36 and Rule 70) 



Applicant's or agent's file reference 
PHM 70251/WO 


See Notification of Transmittal of International 
FOR FURTHER ACTION Preliminary Examination Report (Form PCT/IPEA/41 6) 


International application No 
PCT/GB98/02259 


International filing date (day/month/year) 
28/07/1998 


Priority date (day/month/year) 
01/08/1997 


International Patent Classification (IPC) or nat 
C12N 15/00 


lonal classification and IPC 


Applicant 

ZENECA LIMITED et al. 





1 . This international preliminary examination report has been prepared by this International Preliminary Examining Authority 



and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 8 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 




3. This report contains indications relating to the following items: 

I Basis of the report 

II S Priority 

III Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

IV □ Lack of unity of invention 

V £3 Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 

citations and explanations suporting such statement 

VI ^ Certain documents cited 

VII □ Certain defects in the international application 

VIII S Certain observations on the international application 
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Date of completion of this report 






08/02/1999 
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Name and mailing address of the international 
preliminary examining authority 


Authorized officer 






- European Patent Office 
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Fax +49 89 2399 - 4465 
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Telephone No +49 89 2399 870 
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INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT international application No. PCT/GB98/02259 



I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as '-originally fiied" and are not annexed to 
the report since they do not contain amendments.): 

Description, pages: 

1-13 as originally filed 

Claims, No.: 

1 -21 as originally filed 

Drawings, sheets: 

1/1 9-1 9/1 9 as originally filed 

2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

3. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

4. Additional observations, if necessary: 
tl. Priority 

1 . □ This report has been established as if no priority had been claimed due to the failure to furnish within the 

prescribed time limit the requested: 

□ copy of the earlier application whose priority has been claimed. 

□ translation of the earlier application whose priority has been claimed 

2. □ This report has been established as if no priority had been claimed due to the fact that the priority claim has 

been found invalid. 
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INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT International application No. PCT/GB98/02259 

Thus for the purposes of this report, the international filing date indicated above is considered to be the relevant date. 
3. Additional observations, if necessary: 
see separate sheet 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non-obvious), 
or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
E3 claims Nos. 14,15, 16. 

because: 

□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 

□ the description, claims or drawings (indicate particular elements below) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 

K the claims, or said claims Nos. 14 are so inadequately supported by the description that no meaningful 
opinion could be formed. 

K no international search report has been established for the said claims Nos. 15, 16. 
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V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 

1 . Statement 



Novelty (N) 


Yes: 


Claims 


3, 12, 13, 17-20 




No: 


Claims 


1, 2, 4-11, 21 


Inventive step (IS) 


Yes: 


Claims 


none 




No: 


Claims 


1-13 T 17-21 


Industrial applicability (IA) 


Yes: 


Claims 


1-13, 17, 18, 20, 21 




No: 


Claims 


19 (reserved opinion) 



2. Citations and explanations 
see separate sheet 

VI. Certain documents cited 

1 . Certain published documents (Rule 70.10) 
and / or 

2. Non-written disclosures (Rule 70.9) 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 

see separate sheet 
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II. PRIORITY 

1 ) This first preliminary written opinion has been established considering the priority 
date 01.08.97 as a valid date. The Applicant is reminded that documents: 

WO 97 37223 A (UNIV NORTH CAROLINA) 9 October 1997 
OHARA O. ET AL. in EMBL DATABASE, 5 December 1997 

cited in the international search report may become relevant after consideration of 
the priority document which is unavailable at present. 

III. NON-ESTABLISHMENT OF OPINION 

2) The subject-matter of Claim 14 is not supported by the description and thus, it is 
not amenable to examination of novelty, inventive step or industrial applicability. 
Said claim relates to a method for identifying a compound capable of modulating 
the activity of a ZGGBP1 protein. However, a function of said protein has not been 
demonstrated. Sequence analysis of the encoding gene has revealed several 
structural domains which correspond to several potential functions (p. 2). 
Furthermore, sequence homology of the encoding gene with at least two different 
genes, nedd-4 and Pub3, indicates even more potential functions (p. 3). The 
application as filed, does not provide any indication as to which of all the potential 
activities of ZGGBP1 the method of Claim 14 is directed to. Since the method of 
said claim is not supported by the description, an opinion cannot be established . 

3) No opinion is established for the subject-matter of Claims 15 and 16 because 
said claims relate to inventions in respect of which no international search report 
has been established (Rule 66.1(e) PCT). 

V. REASONED STATEMENT UNDER ARTICLE 35(2) 

4) The present application relates to the isolation of a cDNA clone comprising the 
sequence presented in SEQ ID N01, called gene ZGGBP1 . Said sequence 
resides, along with many other expressed sequences, in the 18q21 chromosomal 
region which was shown previously to be associated with the neurological 
disorder bipolar affective disorder. The ZGGBP1 gene sequence appears to have 
85% identity at the amino acid level with the human gene encoding nedd-4 which 
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was shown previously to be a negative regulator of a sodium channel which is 
deleted in Liddle's syndrome (a hereditary form of hypertension). The ZGGBP1 
gene sequence shares also 100% identity, over a 3kb region, with the previously 
identified gene, Pub3 which may be involved in regulation of cell cycle of 
eukaryotic cells. The Applicant has not demonstrated that said ZGGBP1 gene has 
any function. 

5) The subject-matter of Claims 1, 2, 4-11, 21 is not novel as required by Article 
33(2) PCT. 

Claim 1 relates to a polynucleotide comprising a nucleic acid sequence encoding 
the polypeptide of SEQ ID NO 2, and homologues and fragments. 

Figure 2 of the present application discloses the amino acid sequence of the 
known human nedd-4 gene which is 85% homologous to polypeptide of SEQ ID 
NO 2 (p. 11). Figure 5 of the present application discloses the nucleotide 
sequence of the known gene Pub-3 which comprises fragments up to 3 kb long 
which are identical to the polynucleotide encoding SEQ ID NO 2. 

Thus, the subject-matter of said claim has been disclosed in the prior art. 

Similar arguments apply for the subject-matter of Claims 2, 4-1 1,21. 

6) The subject-matter of Claims 3, 12, 13, 17-20 is not inventive as required by 
Article 33(3) PCT. 

Said claims refer to subject-matter as defined in Claims 1-7, 1 0 or 11 and 

comprise additional technical features which may render said subject-matter 
novel. However, said technical features represent standard options available to 
the skilled person aware of the state of the art in the field of gene technology. 
Thus, said technical features do not impart any inventiveness to said subject- 
matter. Consequently, even if it is assumed that the subject-matter of Claims 3, 
12, 13, 17-20 is novel, said claims do not comprise an inventive step. 

7) Claim 19 is directed to the use of a polynucleotide in gene therapy. Said claim is 
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thus, considered to be directed to method of treatment of the human or animal 
body. For the assessment of claims as far as they are directed to a method of 
treatment of the human or animal body or to a diagnostic method practised on the 
human or animal body, no unified criteria exist in the PCT, on the question 
whether they are industrially applicable. The patentability can be dependent upon 
the formulation of the claims. 

VI. CERTAIN DOCUMENTS CITED 

8) The following documents are cited under Rule 70.10 PCT 

WO 97 37223 A, 9.10.97, filed 03.04.97, with priority date of 03.04.96 

VIII. CERTAIN OBSERVATIONS ON THE INTERNATIONAL APPLICATION 

9) The Applicant is reminded that the claims must be comprehensible from the 
technical point of view and clearly define the object of the invention, that is to say 
indicate all the essential features thereof (Rule 6 PCT). The subject-matter of 
Claims 3, 7, 17 and 21 does not fulfil this condition, as the claimed nucleic acid is 
only defined by the name of the encoded protein "ZGGBP1" without disclosing 
any technical feature which unambiguously characterizes the claimed subject- 
matter. A gene being a chemical product should be clearly defined by its formula 
i.e. its nucleotide sequence. 

10) Claim 1 is drafted towards an isolated nucleic acid with specified sequence 
content. However, the function of said nucleic acid is not clearly stated in the 
claim. 

The Applicant speculates in the description that said nucleic acid encodes a 
protein which may be associated with the bipolar affective disorder although no 
evidence for such association is presented. Furthermore, even if such an 
association of the claimed protein with the bipolar affective disorder is established, 
a function of the protein is not defined nor could be speculated, especially since 
sequence analysis of the protein appears to indicate several potential functions. 
Since there is no conclusive demonstration of any function of the polypeptide of 
SEQ ID NO 2, as such, any function can, at best, be accepted as speculative. At 
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this point, it is not apparent what problem said polypeptide or fragments thereof 
are intended to solve or whether said sequences represent a solution comprising 
an inventive step. Insufficient disclosure of essential technical features, such as 
the function of a given nucleotide or amino acid sequence, does not allow for the 
acknowledgement of an inventive step involved in solving a technical problem. 
The lack of sufficient disclosure of the invention is contrary to Article 5 PCT and 
makes it difficult if not impossible to examine the novelty and inventive step of the 
claimed invention. 

The same arguments apply to the subject-matter of Claims 2-7 and 10-12. 

1 1 ) The subject-matter of Claim 1 1 is not clear as required by Rule 6 PCT. Said claim 
relates to a polypeptide comprising the amino acid sequence of SEQ ID NO 4. As 
said sequence is being disclosed only in the sequence listing without any mention 
of it in the description, it is not entirely clear what relation it bears with the 
invention as disclosed in the present application. 
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ZGGBP1, NOVEL PEPTIDES RELATED TO BIPOLAR AFFECTIVE DISORDER TYPE 1, SEQUENCES AND USES 
THEREOF 

This invention relates to a novel human gene (ZGGBP1 ) associated with affective 
neurological disorders such as bipolar affective disorder. The invention also relates to 
5 homologues of the ZGGBP1 gene in species such as rat and mouse useful in providing 
animal models of affective disorders. The invention further relates to both the cDNA and 
the structural gene and to fragments encoding functional domains within the gene. The 
invention also relates to means for producing the protein encoded by the gene and to 
means for regulating its production and activity in vivo. 

10 Affective disorders comprise a broad and heterogeneous category of psychiatric 

illness with a prevalence of up to 20% in the population. The most severe of these 
disorders is bipolar type I which affects approximately 1% of the population and this rate 
is fairly consistent across countries. The disease affects young adults, with a mean age of 
onset of 22 years. Treatment depends upon the phase of the disease and pharmacological 

15 agents include lithium carbonate, carbamazepine or valproic acid, tricyclic antidepressants. 
Monoamine oxidase inhibitors and selective serotonin re-uptake inhibitors are now also 
being used. The success rate of individual drugs is variable and some patients are treated 
with a combination of agents, although most have some unwanted side-effects. At present 
the precise diagnosis of individual affective disorders is difficult and new, gene based, 

20 diagnostic methods are desirable. 

Family, twin and adoption studies have suggested the importance of genetic 
predisposition to bipolar affective disorder. On this basis, several groups have undertaken 
genetic linkage analysis in families with a high incidence of the disorder to find a causal 
gene. Many of the studies show conflicting data suggesting that a single gene is unlikely 

25 to be the cause. Rather, multiple interacting genetic traits may be involved. A recent 

study (Stine et al. 1995) identified two regions on chromosome 18 showing linkage to the 
disease. 

The present invention is based on our discovery of a novel gene which 
maps to 18q21 and which unexpectedly shows appreciable sequence homology to the ned- 
30 4 gene on chromosome 15. Ned-4 is the human homologue of the mouse nedd-4 gene 
which is known to be differentially expressed during neural development and to be 
involved in signal transduction. Human ned-4 has been shown (Schild et al. 1996. Straub 
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et al. 1996) to be a negative regulator of a sodium channel which is deleted in Liddle's 
syndrome (a hereditary form of hypertension). 

Nedd-4 was originally isolated as a partial cDNA clone from a mouse brain 
library (Kumar et al. 1992) as one of a set of genes which were differentially expressed 
during development (Neural precursor cells expressed developmentally down-regulated). 
The derived amino acid sequence contains three copies of the WW domain (Andre & 
Spnngael 1994, Bork & Sudol, 1994; Hofmann & Boucher, 1995), a Ca lipid binding 
(CaLB/C2) domain (Brose et al. 1995) and a Hect (homologous to the E6-AP carbodyl 
terminus) domain which has homology to a ubiquitin ligase (E3) enzyme (Huibregtse et al. 
1995). The human homologue of nedd-4 (Ned-4) was isolated as an randomly cloned EST 
(K1AA0093) from immature myeloblast mRNA (Nomura et al. 1994) and shown by 
sequence comparison to have 86% identity at the amino acid level to the mouse sequence. 
The human sequence, however, has a fourth copy of the WW domain. 

The WW domain is a 40 amino acid sequence found in several unrelated proteins. 
The two highly conserved tryptophans give it its name. The function of the domain is 
thought to be involved in protein-protein interactions. Despite their functional diversity, the 
proteins listed all appear to be involved in cell signalling or regulation. It has been shown 
that the WW domains of Nedd-4 interact with the proline-nch PY motifs in the epithelial 
sodium channel in the kidney (Schild et al. 1996). Mutational deletion of the PY motifs in 
the epithelium sodium channel in Liddle's syndrome, an inherited disease causing systemic 
hypertension characterised by hyperactivity of the sodium channel, has been shown to 
abrogate binding of Nedd-4 (Straub et al. 1996). It is therefore likely that Nedd-4 has a 
negative regulatory role when bound to the channel. 

The Hect domain is an E3 ubiquitin-protein ligase domain and enzymes with this 
domain catalyse polyubiquitination. which is involved in several cellular processes 
including proteolytic degradation. 

The CaLB/C2 domain is thought to be involved in calcium-dependent phospholipid 
binding, although some proteins containing this domain do not bind calcium and other 
putative functions for the C2 domain such as binding to inositol - 1 ,3,4,5-tetraphosphate 
have been suggested. Examples of proteins containing this domain are Protein Kinase C 
(PKC) isoenzymes and synaptogamins. 
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PCT patent application W097/ 12962 discloses a protein (Pub3) with homology to 
Publ, a Schizosaccaromyces Pombe protein which has an apparent function in the 
ubiquitination of, among other cellular proteins, the mitotic activating tyrosine 
phosphatase cdc25 and the tumour suppresser protein p53. As such this protein may be 
5 involved in regulating the progression of proliferation in eukaryotic cells by effectively 
controlling the activity of the cdk complexes by modulating the availability of cdc25 
and/or p53. 

A comparison of Pub3 with ZGGBP1 revealed that the sequences represent two 
distinct genes which code for two separate, structurally unrelated proteins. The two genes 

10 share sequence homology within a certain defined region, the sequences are identical 

within the region 516-3568 of ZGGBP1, but they do not show any homology within the 
regions 5' and 3'of this sequence. In addition the derived amino acid sequence for 
ZGGBP1 is completely different to that derived for Pub 3 as both have been initiated from 
a different start methionine. A comparison of the nucleotide sequences for ZGGBP1 and 

15 Pub 3 is outlined in Figure 5. 

Therefore in a first aspect of the present invention we provide the ZGGBP1 gene 
having the full length cDNA as set out in SEQ ID NO: 1 . We further provide fragments of 
the ZGGBP1 gene comprising ZGGBP1 sequence outside the region defined by base pairs 
516-3568 of the ZGGBP1 gene. By fragments we mean contiguous regions of the gene 

20 including complementary DNA and RNA sequences, starting with short sequences useful 
as probes or primers of say about 8-50 bases, such as 10-30 bases or 15-35 bases, to longer 
sequences of up to 50, 100, 200, 500 or 1000 bases. Indeed any convenient fragment of 
the gene of say up to 2kb, 3kb, 4kb or more than 4kb may be a useful gene fragment for 
further research, therapeutic or diagnostic purposes. Further convenient fragments include 

25 those whose terminii are defined by restriction sites within the gene of one or more kinds, 
such as any combination of Rsa 1 , Alu 1 and Hinf 1 . 

In a further aspect of the invention we provide homologues of the ZGGBP1 gene in 
species such as rat and mouse useful in providing animal models of affective disorders. 
By homologue, we mean a corresponding ZGGBP1 gene in another species, which 

30 displays greater than 85% sequence homology, conveniently greater than 909r, for 

example 95%. to the human ZGGBP1 sequence. The full sequences of the individual 
homologues may be determined using conventional techniques such as hybridisation, PGR 
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and sequencing techniques, starting with any convenient part of the sequence set out in 
SEQ ID NO: 1 . The partial sequence of the mouse gene is set out in SEQ ID NO: 3 and 
this gene and the protein encoded by this gene represent further independent aspects of the 
invention. 

5 In a further aspect of the invention we provide polynucleotide sequences capable of 

specifically hybridising to the ZGGBP1 gene. By specifically hybridising we mean that 
the polynucleotide hybridises under stringent conditions to the sequence on chromosome 
1 8q2 1 as set out in SEQ ID No: 1, or to the corresponding non-coding sequence, to the 
exclusion of other genomic loci. It is contemplated that a species such as a peptide nucleic 

10 acid may be an acceptable equivalent to a polynucleotide, at least for purposes that do not 
require translation into protein. 

In a further aspect of the invention we provide a recombinant ZGGBP1 protein 
obtained by expression of all or a part of the cDNA as set out in SEQ ID NO: 1 . The 
recombinant protein may comprise all or a convenient part of the peptide sequence set out 

15 in SEQ ID NO: 2. The production of a protein according to the invention may be achieved 
using standard recombinant DNA techniques involving the expression of the protein by a 
host cell as described for example by Sambrook et al. 1989. The isolated nucleic acids 
described herein may for example be introduced into any convenient expression vector for 
example the T7 Studier system for expression in E.coli (US-A-4952496), Pichia pastoris 

20 for expression in yeast, the Baculovirus system for expression in insect cells and the GS 
system for expression in mammalian cells by operatively linking the DNA to any- 
necessary expression control elements therein and transforming any suitable prokaryotic 
or eukaryotic host cell with the vector using well known procedures. 

Therefore in a further aspect of the invention we provide a recombinant plasmid 

25 comprising all or a part of the ZGGBP1 cDNA of the invention. 

The invention further extends to cells containing said recombinant plasmids and to 
a process for producing a ZGGBP1 protein of the invention which comprises culturing 
said cells such that the desired protein is expressed and recovering the protein from the 
culture. 

30 By way of example, the nucleotide sequence in SEQ ID NO: 1 is inserted 

downstream of the SV40 promoter in the pGEX plasmid vector, and either transiently or 
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stably expressed in COS -7 cells. Expression of the protein according to the invention can 
be delected following disruption of the cells by Western blotting . 

It may be desirable to produce the individual functional domains of the protein 
according to the invention in isolation from the rest of the molecule. This may be 
5 achieved using the above standard recombination DNA techniques except that in this 

instance the DNA sequence used is that encoding one of the partial amino acid sequences 
of the domains identified in Figure 1 or a combination of these. 

By way of further example, the nucleotide sequence in SEQ ID NO: 1 is inserted 
downstream of the SV40 promoter and the glutathionc-S-transferase (GST) coding 
10 sequence in the pBC plasmid vector, and either transiently or stably expressed in COS -7 
cells allowing expression of the corresponding fusion protein. Expression of the fusion 
protein can be detected following disruption of the cells by Western blotting with 
antibodies to GST, and furthermore the fusion protein can be used in an affinity binding 
procedure to find proteins which are functional partners of the protein of the invention 
15 from cell extracts. 

A ZGGBP1 protein of the invention may in particular be used to screen for 
compounds which regulate the activity of the enzymes and the invention extends to such a 
screen and to the use of compounds obtainable therefrom to regulate the activity of the 
protein in vivo. 

20 Thus according to a further aspect of the invention we provide a method for 

identifying a compound capable of modulating the action of a ZGGBP1 protein which 
method comprises subjecting one or more test compounds to a screen comprising (A) a 
protein containing the amino acid sequence shown in SEQ ID NO: 2 or a homologue or 
fragment thereof, or (B) the nucleotide sequence shown in SEQ ID NO: 1 or a homologue 

25 or fragment thereof, or (C) a host cell expressing a ZGGBP1 polypeptide or a homologue 
or fragment thereof. 

The screen according to the invention may be operated using conventional 
procedures, for example by bringing the test compound or compounds to be screened and 
an appropriate substrate into contact with the protein or a cell capable of producing it and 

30 determining affinity for the protein in accordance with conventional procedures. 

Any compound identified in this way may be used in the treatment of humans 
and/or other animals of one or more of the above mentioned diseases. The invention thus 
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extends to a compound selected through its ability to regulate the activity of the protein in 
vivo as primarily determined in a screening assay utilising the protein containing an amino 
acid sequence shown in SEQ ID NO: 2 or a homologue or fragment thereof, or a gene 
coding therefor for use in the treatment of a disease in which the over- or undcr-activity or 
5 unregulated activity of the protein is implicated. 

In a further aspect of the invention we provide examples of insertions/deletions and 
single base change polymorphisms (mutations) as outlined in Figure 6, 7, 8, 9 and 10. 

The ZGGBP1 gene of the invention may also be used as the basis for diagnosis, for 
example to determine expression levels in a human subject, by for example direct DNA 
10 sequence comparison or DNA/RNA hybridisation assays. Diagnostic assays may involve 
the use of nucleic acid amplification technology such as the PGR and in particular the 
Amplification Refractory Mutation System (ARMS) as claimed in our European Patent 
No. 0 332 435. Such assays may be used to determine allelic variants of the gene, for 
example insertions, deletions and/or mutations such as one or more point mutations. Such 
15 variants may be heterozygous or homozygous. 

In a further aspect of the invention, amplification primers may be provided for use 
in the above diagnostic methods. In general, these are provided as a set and used for PCR 
amplification. One of the primers conveniently hybridises to a ZGGBP1 locus outside the 
region defined by base pairs 516-3568 thus allowing the ZGGBP1 gene on 18q21 to be 
20 identified to the exclusion of other loci. 

The ZGGBP1 gene may also be used in gene therapy, for example where it is 
desired to modify the production of the protein in vivo, and the invention extends to such 
uses. 

Knowledge of the gene according to the invention also provides the ability to 
25 regulate its expression in vivo by for example the use of antisense DNA or RNA. Thus, 

according to a further aspect of the invention we provide an antisense DNA or an antisense 
RNA which is complementary to the polynucleotide sequence shown in SEQ ID NO: 1 . 
By complementary we mean that the two molecules can base pair to form a double 
stranded molecule. 

?° The antisense DNA or RNA for co-operation with the gene in SEQ ID NO: 1 can 

be produced using conventional means, by standard molecular biology and/or bv chemical 
synthesis as described above. If desired, the antisense DNA or antisense RNA mav be 
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chcmically modified so as to prevent degradation in vivo or to facilitate passage through a 
cell membrane and/or a substance capable of inactivating mRNA. for example ribozyme, 
may be linked thereto and the invention extends to such constructs. 

The antisense DNA or antisense RN A may be of use in the treatment of diseases or 
5 disorders in humans in which the over- or under-regulated production of the gene product 
has been implicated. Such diseases or disorders may include those described under the 
general headings of neurologic, eg. stroke, dementia, renal eg. hypertension, nephrosis, 
cardiovascular disorders. 

Convenient DNA sequences may be obtained using conventional molecular 
10 biology procedures, for example by probing a human genomic or cDNA library with one 
or more labelled oligonucleotide probes containing 10 or more contiguous nucleotides 
designed using the nucleotide sequences described here. Alternatively, pairs of 
oligonucleotides one of which is homologous to the sense strand and one to the antisense 
strand, designed using the nucleotide sequences described here to flank a specific region of 
15 DNA may be used to amplify that DNA from a cDNA library. 

The ZGGBP1 protein of the invention and homologues or fragments thereof may 
be used to generate substances which selectively bind to it and in so doing regulate the 
activity of the protein. Such substances include, for example, antibodies, and the 
invention extends in particular to an antibody which is capable of recognising one or more 
20 epitopes containing the protein binding domains shown in Figure 1. In particular the 
antibody may be neutralising antibody. 

As used herein the term antibody is to be understood to mean a whole antibody or a 
fragment thereof, for example a F(ab)2, Fab, FV,. VH or VK fragment, a single chain 
antibody, a multimeric monospecific antibody or fragment thereof, or a bi- or multi- 
25 specific antibody or fragment thereof. 

The invention will now be illustrated but not limited by reference to the following 
detailed description. References, Examples and Figures wherein: 



30 



Figure 1 shows the predicted amino acid sequence of ZGGBP1. The C2 domain is 
indicated by carets , the four WW domains are indicated by asterisks and the Hect domain 
is indicated by underlining . 
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Figure 2 shows a comparison of amino acid sequences of human ned4 Swissprot entry 
P46934 and ZGGBP1. 

Figure 3 shows a Northern blot analysis of various human tissues probed with ZGGBP1 . 
Figure 4 shows a comparison of the nucleic acid sequences of human and mouse 
5 ZZGBP1. The mouse sequence is a partial cDNA which spans the C-tcrminal portion of 
the human protein coding region. 

Figure 5 shows a comparison of the nucleic acid sequences lor ZGGBP1 and Pub3 
Figure 6 shows a polymorphism located at position 3554 of the cDNA sequence 
Figure 7 shows a polymorphism located at position 4828 of the cDNA sequence 
10 Figure 8 shows a polymorphism located in an intronic sequence derived from a BAC 
containing ZGGBP1 

Figure 9 shows a variable number of tetranucleotidc repeats located within an intronic 
sequence from ZGGBP1 

Figure 10 shows an insertion at position 4032 of the cDNA sequence 

15 
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Example 1 

Identification of ZGGBP1 

We used two methods for investigating the 18q2 1 region of interest. In one 
method we used positional cloning to identify novel transcripts from physical clones 
representing the region and in a second method we utilised public databases to identify 
transcripts which had been assigned to a low resolution map of the region by radiation 
hybrid mapping and assigned them to physical clones representing a high resolution map 
of the region. 



20 Method 1 - Positional Cloning 

The 18q21 region described by Stinc et al. (1995 ) is delimited by the STS markers 
used by that group to identify linkage. They found the most strongly linked marker to be 
D18S4 1 , which had a LOD score of 3.5 1 in cases of paternal inheritance. Linkage 
declined over flanking markers. We identified a set of four Yeast Artificial Chromosomes 

25 (YACs) which comprised a contiguous overlapping set of genomic clones covering the 
defined region by the presence in those YACs of STS markers used in the Stine study. 

DNA from the YACs was prepared and used in a PCR-based hybridisation 
approach to enrich for transcripts from a human fetal brain cDNA library. This approach, 
known as direct selection (Lovett et al. 199 1 ) has been shown to be efficient in identifying 

30 transcripts present on large genomic clones. 
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Method 2 - Refining Radiation Hybrid Mapped Transcripts 

The UNIGENE database is a repository for transcripts which have been mapped by 
taking representative Expressed Sequence Tagged Sites (ESTs) and performing PCR 
analysis on a pane! of radiation hybrids which have been calibrated with respect to a 
5 framework of 1000 genetic markers (Schuler et al. 1996). We found 36 EST clusters 
which had been mapped to a radiation hybrid map interval which corresponded to the 
18q21 region of interest and to flanking regions outside. 

All the ESTs were tested by PCR on our YAC genomic clones to determine which 
were present. We found approximately half of the ESTs to be present within the genomic 
10 clones and were able to order them based on their position within the YAC contig. 



Several clones from our direct selection experiments showed sequence homology 
to a known EST which we had previously shown to be present in two of the Y ACs within 

15 the contig. The EST was representative of a cluster of sequences. All of these sequences 
were assembled together using DNAStar Seqman and the consensus sequences obtained 
were used iteratively to search for other database members within both Unigene. dbEST 
and EMBL databases. This resulted in the surprising identification of two further clusters 
of ESTs which had previously not been related to each other on the basis of sequence 

20 analysis. The two new EST clusters were annotated as having sequence similarity to ncd- 
4. This was an unexpected finding since we had recently mapped the human ned-4 by 
Fluorescence In Situ Hybridisation (FISH) to chromosome 15. We were aware that ned-4 
was involved in neuronal cell signalling and we concluded that the EST cluster on 18q21 
must represent a closely related gene and therefore likely to be involved in affective 

25 neurological disorders such as bipolar affective disorder. 

The assembly of the EST clusters did not give rise to a single complete contiguous 
sequence. The reason for this is that many of the EST sequences were derived from 
IMAGE cDNA clones for which end sequence only was available. In order to fill in the 
gaps and give a complete contig, four of these clones (IMAGE LD. 8095 1 , 33059, 79526 

30 and 79984} were sequenced completely to fill the gaps and give an entire complete 

contiguous sequence. Comparison of the sequence with ned-4 showed that the contig 
comprised 2kb of 3' Untranslated Region (UTR) and 70()bp of the coding region of a gene 



Results 
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which had approximately 85% identity at the amino acid level to ncd-4 and which we 
named ZGGBP1. 

Isolation of the full length gene for ZGGBP1 

5 The extending of partial transcripts to full length clones can be a complex and 

difficult process requiring skill and expertise for success. Having considered several 
possibilities, we opted for a PCR-based approach to isolate and characterise the full length 
ZGGBP1 gene. Human foetal brain double stranded cDNA was synthesised from mRNA 
using standard methods (Sambrook et al. 1989) and ligated into lambda Zap vector by use 

10 of adapters. However, in order to minimise the loss of transcripts often seen following the 
cloning step, the resulting ligation mix was not cloned but was instead used as a template 
for PCR. Oligonucleotide primers specific to ZGGBP1 were used in combination with 
vector specific primers to amplify DNA across the unknown part of the gene. Since the 
distance to be covered was unknown, we performed long PCR using the commercially 

15 available BCL Expand enzyme and long (30mer) oligonucleotide primers. Since we were 
using unamplified material, where our target cDNAs were likely to be present only in 
very small amounts, we utilised a secondary PCR step with nested oligonucleotide 
primers and again using long PCR to yield sufficient PCR products to be visible by gel 
analysis and also to minimise the possibility of non-specific PCR amplification. The PCR 

20 products derived from these experiments were then purified and sequenced directly. 

Where necessary, the DNA sequence obtained was used to design further primers to walk 
along the gene in a 3' - 5' direction. The complete nucleotide sequence derived from 
this work is 5.2kb and the translated amino acid sequence is shown in SEQ ID NO: 1. 

The amino acid sequence derived from the cDNA was compared with that of ned-4 

25 and is shown in Figure 2. The proteins diverge markedly towards the N-tcrminal portion 
of the protein, although there is conservation of the common functional motifs. 

Northern analysis using a probe derived from the 3'UTR of ZGGBP1 showed a 
band at approximately 4.8kb but also a more abundant band of 9kb in size in several 
neurological tissues, with the exception of medulla or spinal cord. These bands are likely 

30 to be due to alternative splicing (Figure 3). Other tissues contained the 4.8kb band at 
higher abundance with respect to the 9kb band and also a 4kb band. ZGGBP1 was 
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expressed in all tissues examined with the exception of liver where we could not detect a 
transcript at our current detection sensitivity. 



Comparison of Amino Acid Sequences of human ned-4 and ZGGBP1 

5 A comparison of the amino acid sequences of human ned-4 and ZGGBP1 is 

shown in Figure 6. The two proteins have a high level of homology over much of the C- 
terminal region, including the Hect and WW domains, but diverge over the central portion 
of the protein. There is a further block of homology near to the N-terminal region, 
including the C2 domain. The presence of these domains in ZGGBP1 suggests some 
10 common functionality with ned-4. 

Identification of polymorphic variants of ZGGBP1 

500bp regions of the ZGGBP1 cDNA were PCR amplified from a variety of 
tissues and lymphoblastoid cell lines. Sequencing was carried out and polymorphisms 
15 identified as outlined in Figures 5 and 6. Some intronic sequence had been identified from 
a genomic clone and sequence analysis of these regions identified a further polymorphic 
variant as outlined in Figure 7. A tetranucleotide repeat (GATT) was also identified in an 
intronic sequence derived from this BAC and this was found to have variable numbers of 
repeats (Figure 8). 

20 

Isolation of Genomic Clone for ZGGBP1 

The Research Genetics human Bacterial Artificial Chromosome (BAC) library 
(Shizua et al. 1992, Kim et al. 1996) was screened by PCR using primers specific to the 
3'UTR of ZGGBP1 and BACs were isolated. These are being used to characterise the 
25 structural gene including the intron/exon structure and the 5' regulatory region. 

Isolation of Mouse homologue for ZGGBP1 

The full length sequence of ZGGBP1 shown in SEQ ID NO: 1 was used to search 
the dbEST database to identify homologous mouse sequences. Three overlapping IMAGE 
30 clones were identified (IMAGE I. D. 479436, 5735 10. 482922) comprising a partial 

transcript. Comparison of the mouse and human nucleotide sequence is shown in Figure 
4. The mouse clones were isolated for use as a probe for in situ hybridisation on sections 
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of mouse brain during development, and as a probe of mouse genomic libraries to isolate 
genomic clones and to produce transgenic mice by gene targeting using homologous 
recombination. 
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CLAIMS 

1. A polynucleotide comprising a nucleic acid sequence which encodes the polypeptide 
of Seq ID No 2, and homologues and fragments thereof. 

5 

2. A polynucleotide as claimed in claim 1 which comprises the cDNA sequence of Seq 
ID No 1 . 

3. Polymorphic variants of the polynucleotide as claimed in claim 2, selected from the 
10 group in which: 

i) T at position 3554 is replaced by C. 
li) C at position 4828 is replaced by G. 

lii) T within an intronic region associated with ZGGBP1 is replaced by C. 
iv) C is inserted at position 4032. 

15 

4. A polynucleotide which comprises an animal homologue of the nucleic acid claimed in 
claims 1-3. 

5. A polynucleotide as claimed in claim 4 which comprises the cDNA sequence of Seq 
20 ID No 3. and homologues and fragments thereof. 

6. A polynucleotide which is capable of specifically hybridising to eight or more 
contiguous nucleotides comprised in Seq ID No 1 or Seq ID No 3 or comprised in the 
complementary strands thereof. 

25 

7. A polynucleotide which comprises a ZGGBP1 gene fragment. 

8. A vector comprising a polynucleotide of claims 1-7. 
30 9. A host cell transformed with a vector of claim 8. 
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10. A polypeptide comprising the amino acid sequence of Seq ID No 2 and homologues 
and fragments thereof. 

11. A polypeptide comprising the amino acid sequence of Seq ID No 4 and homologues 
5 and fragments thereof. 

12. A fusion protein in which a polypeptide of claim 10 or claim 1 1 is fused with 
glutatfnone-S-transferase. 

10 13. A method for producing cells which express a polypeptide of claim 10 or claim 1 1 or a 
fusion protein of claim 12, comprising: 

a) culturing a host cell of claim 9 under conditions suitable for the expression of the 
polypeptide. 

b) recovering the polypeptide from the host cell culture. 

15 

14. A method for identifying a compound capable of modulating the activity of a ZGGBP 1 
protein , which method comprises subjecting one or more test compounds to a screen 
comprising: 

a) a protein as claimed in claims 10-12 or a homologue or fragment thereof. 
20 or 

b) a polynucleotide as claimed in claims 1-7 or a homologue or fragment thereof, 
or 

c) a host-cell expressing a polypeptide of a ZGGBP 1 molecule, 
and measuring an effect of the test compound on ZGGBP 1 activity. 

25 

15. A compound that modulates the activity of a human ZGGBP 1 identified by the method 
of claim 14. 

16. A pharmaceutical composition comprising a compound that modulates the activity of a 
30 protein identified by the method of claim 14. 
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17. A diagnostic assay for the detection of ZGGBP1, which assay comprises measuring 
the presence or absence of a protein as claimed in claims 10-12 or a polynucleotide as claimed 
in claims 1-7. 



18. An antisense molecule comprising a complement of the polynucleotide in claims 1-7 
or a biologically effective fragment thereof. 



19. Use of a polynucleotide as claimed in claims 1-7 or claim 18 in gene therapy. 

10 20. An antibody specific for a protein of claims 10-12 or fragments thereof. 

21. A set of amplification primers for selective amplification of a ZGGBP1 gene 
sequence. 
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FIGURE 1 
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FIGURE 3 
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FIGURE A 
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Wild Type (human foetal brain) T/T 

Variant Type (human adult brain) T/C 

Polymorphism Position 3554 
RFLP 



FIGURE 6 
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Wild Type (GM1416) C/C 
Variant (7225) C/G 
Position 4828 



FIGURE 7 
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Primer sequences derived from BAC and used on lymphoblastoid cell lines from 
BPAD Patients. 

Homozygous wild type (KK169) - T/T 
Homozygous variant (KK232) - C/C 



FIGURE 8 
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tgctgcaagtgacaggttccaagaagcccgagggctcagagctgaatgatgaagcg: 
agtccccaaagtgcctggccacccctccctccctggatcactgctgcctgggcttga 
ttgattgattgattgattgattgattgatt ttgagagagattctcactgtcacccai 
gctggagtacagtggtgcgatctcggctcactgcagcctctgcctcccgggttcaag 
caattctcctgcctcagcctcccaagtagctgggactacaggcacgcgccaccaca: 
ccagctaattttgtatttttagtaaaagacggggtttcaccatgttgggccaggatg 
gtcttgatctcctgacctcatgatccacccgccccggcttccaaagtgctgggatac 
aggcatgaacccgacgcgcccagcatggacatttttttttaatcccctgcccttttc 
ttgnggcataattcattgcaggtctcttctatacagatcatggaaaacacattttct 
taactgagttnttattatttatacccagncacctcatgacanntttaccctgttaca 
nacaaaatgggcacctgccaat^ancaactttnatataaggatgctccaggcct 



Tet ranucleot ide repeat underlined 
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Top electropherogram (human foetal brain) - wild type 



Lower electropherogram (7225) - heterozygous variant 

Arrow indicates the position of the C+C insertion - position 4032 
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SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION: 

(i) APPLICANT: 

( A) NAME: Zencca Limited 

(B) STREET: 15 Stanhope Gate 

(C) CITY: London 
ID) STATE: England 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): W1Y 6LN 

(G) TELEPHONE: 0171 304 5000 

(H) TELEFAX: 0171 304 5151 

(I) TELEX: 0171 304 2042 

(ii) TITLE OF INVENTION: NOVEL COMPOUNDS 

(iii) NUMBER OF SEQUENCES: 5 

(lv) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Floppy disk 
( B) COMPUTER: IBM PC compatible 
(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
(D ) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9716162.4 

(B) FILING DATE: 01 -AUG- 1997 



(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 
tA) LENGTH: 5154 base pairs 
( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

CAAGCGCGCA ATTAACCCTC ACTAAAGGGA ACACCAACAC GTCGCCAGGA 
CTGCGCCGTT 60 
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CGCTGCGCTC ATAGGCGGCG ATTTCATCAA GGGTGGCAAG GATCGCCTGG 
TCGACGGTCA 120 

GGTCGTCCTC GACGCGGTTG CCCTCCTCGT CCTGTTCCAG GGTGAGTGGG 
CGATACCAGG 180 

TGTCCACCGG GAAGGTACGG CCCGACACCT CGACAATCGG CGCATCGTCG 
AAGTGCTTGG 240 

AAAAGCGCTC CAGGTCGATG GTGGCCGAGG TGATGATGAC TTTCAGGTCG 
GGGCGACGCG 300 

GCAACAGGGT CTTGAGGTAG CCGAGCAGGA AGTCGATGTT CAGGCTGCGT 
TCGTGGGCTT 360 

CGTCGACGAC AGGCTCGCGT TATGGCTCCG CTTTCTGCGG CTCTCCTACC 
CTGGCATGGT 420 

GTGTGTGTGT GCCTGTGTGC TACGGAGAGT CCCGTATTCT CAGAGTAAAA 
GTTGTTCTGG 480 

AATGATCTCG CCAAAAAGGA CATCTTTGGA GCCAGTGATC CGTATGTGAA 
ACTTTC A TTG 540 

TACGTAGCGG ATGAGAATAG AG A ACTTGCT TTGGTCCAGA C AAA A AC A AT 
TAAAAAGACA 600 

CTGAACCCAA AATGGAATGA AGAATTTTAT TTCAGGGTAA ACCCATCTAA 
TCACAGACTC 660 

CTATTTGAAG TATTTGACGA AAATAGACTG ACACGAGACG ACTTCCTGGG 
CCAGGTGGAC 720 

GTGCCCCTTA GTCACCTTCC G AC AG A AG AT CCAACCATGG AGCGACCCTA 
TACATTTAAG 780 

GACTTTCTCC TCAGACCAAG AAGTCATAAG TCTCGAGTTA AGGGATTTTT 
GCGATTGAAA 840 

ATGGCCTATA TGCCAAAAAA TGGAGGTCAA GATGAAGAAA ACAGTGACCA 
GAGGGATGAC 900 

ATGGAGCATG GATGGGAAGT TGTTGACTCA AATGACTCGG CTTCTCAGCA 
CCAAGAGGAA 960 

CTTCCTCCTC CTCCTCTGCC TCCCGGGTGG GAAGAAAAAG TGGACAATTT 
AGGCCGAACT 1020 
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TACTATGTCA ACCACAACAA CCGGACCACT CAGTGGCACA GACCAAGCCT 
GATGGACGTG 1080 

TCCTCGGAGT CGGACAATAA CATC AG AC AG ATCAACCAGG AGGCAGCACA 
CCGGCGCTTC 1 140 

CGCTCCCGCA GGCACATCAG CGAAGACTTG GAGCCCGAGC CCTCGGAGGG 
CGGGGATGTC 1200 

CCCGAGCCTT GGGAGACCAT TTCAGAGGAA GTGAATATCG CTGGAGACTC 
TCTCGGTGTG 1260 

GTTTTG CCCC CACCACCGGC CTCCCCAGGA TCTCGGACCA GCCCTCAGGA 
GCTGTCAGAG 1 320 

GAACTAAGCA GAAGGCTTCA GATCACTCCA GACTCCAATG GGGAACAGTT 
CAGCTCTTTG 1380 

ATTCAAAGAG AACCCTCCTC AAGGTTGAGG TCATGCAGTG TCACCGACGC 
AGTTGCAGAA 1440 

CAGGGCCATC TACCACCGCC ATCAGTGGCC TATGTACATA CCACGCCGGG 
TCTGCCTTCA 1500 

GGCTGGGAAG AAAGAAAAGA TGCTAAGGGG CGCACATACT ATGTCAATCA 
TAACAATCGA 1 560 

ACCACAACTT GGACTCGACC TATCATGCAG CTTGCAGAAG ATGGTGCGTC 
CGGATCAGCC 1620 

ACAAACAGTA ACAACCATCT AATCGAGCCT CAGATCCGCC GGCCTCGTAG 
CCTCAGCTCG 1680 

CCAACAGTAA CTTTATTGCC CCGCTGGAGG GTGCCAAGGA CTCACCCGTA 
CGTCGGGCTG 1 740 

TGAAAGACAC CCTTTCCAAC CCACAGTCCC CACAGCCATC ACCTTACAAC 
TCCCCCAAAC 1 800 

C AC A AC AC A A AGTCACACAG AGCTTCTTGC CACCCGGCTG GGAAATGAGG 
ATAGCGCCAA 1860 

ACGGCCGGCC CTTCTTCATT GATCATAACA CAAAGACTAC AACCTGGGAA 
GATCCACGTT 1920 

TG A A ATTTC C AGTACATATG CGGTCAAAGA CATCTTTAAA CCCCAATGAC 
CTTGGCCCCC 1980 
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TTCCTCCTGG CTGGGAAGAA AGAATTCACT TGGATGGCCG AACGTTTTAT 
ATTGATCATA 2040 

ATAGCAAAAT TACTCAGTGG GAAGACCCAA GACTGCAGAA CCCAGCTATT 
ACTGGTCCGG 2100 

CTGTCCCTTA CTCCAGAGAA TTTAAGCAGA AATATGACTA CTTCAGGAAG 
AAATTAAAGA 2160 

AACCTGCTGA TATCCCCAAT AGGTTTGAAA TGAAACTTCA CAGAAATAAC 
ATATTTGAAG 2220 

AGTCCTATCG GAGAATTATG TCCGTGAAAA GACCAGATGT CCTAAAAGCT 
AGACTGTGGA 2280 

TTGAGTTTGA ATCAGAGAAA GGTCTTGACT ATGGGGGTGT GGCCAGAGAA 
TGGTTCTTCT 2340 

TACTGTCCAA AGAGATGTTC AACCCCTACT ACGGCCTCTT TGAGTACTCT 
GCCACGGACA 2400 

ACTACACCCT TCAGATCAAC CCTAATTCAG GCCTCTGTAA TGAGGATCAT 
TTGTCCTACT 2460 

TCACTTTTAT TGGAAGAGTT GCTGGTCTGG CCGTATTTCA TGGGAAGCTC 
TTAGATGGTT 2520 

TCTTCATTAG ACCATTTTAC AAGATGATGT TGGGAAAGCA GATAACCCTG 
AATGACATGG 2580 

AATCTGTGGA TAGTGAATAT TACAACTCTT TGAAATGGAT CCTGGAGAAT 
GACCCTACTG 2640 

AGCTGGACCT CATGTTCTGC ATAGACGAAG AAAACTTTGG ACAGACATAT 
CAAGTGGATT 2700 

TGAAGCCCAA TGGGTCAGAA ATAATGGTCA CAAATGAAAA CAAAAGGGAA 
TATATCGACT 2760 

TAGTCATCCA GTGGAGATTT GTGAACAGGG TCCAGAAGCA GATGAACGCC 
TTCTTGGAGG 2820 

GATTCACAGA ACTACTTCCT ATTG ATTTG A TTAAAATTTT TGATGAAAAT 
GAGCTGGAGT 2880 

TGCTCATGTG CGGCCTCGGT GATGTGGATG TGAATGACTG GAGACAGCAT 
TCTATTTACA 2940 
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AGAACGGCTA CTGCCCAAAC CACCCCGTCA TTCAGTGGTT CTGGAAGGCT 
GTGCTACTCA 3000 

TGGACGCCGA AAAGCGTATC CGGTTACTGC AGTTTGTCAC AGGGACATCG 
CGAGTACCTA 3060 

TGAATGGATT TGCCGAACTT TATGGTTCCA ATGGTCCTCA GCTGTTTACA 
ATAGAGCAAT 3120 

GGGGCAGTCC TGAGAAACTG CCCAGAGCTC ACACATGCTT TAATCGCCTT 
GACTTACCTC 3 1 80 

CATATGAAAC CTTTGAAGAT TTACGAGAGA AACTTCTCAT GGCCGTGGAA 
AATGCTCAAG 3240 

GATTTGAAGG GGTGGATTAA GCACCCTGTG CCTCGGGGGT GGTTGTTCTT 
CAAGCAAGTT 3300 

CTGCTTGCAC TTTTGCATTT GCCTAACAGA CTTTTGCAGA GGCGATGGCA 
GAGAGCAGCT 3360 

GCAGGCATGG TCCCTGGAGC CGAGCCTTCA CCACGCACTC GTCCAAGTTC 
GGGATGCGGG 3420 

AACCTGGTCC CAGCTTGAGT TCCTGCCTTT CCCACCACAA ATTATCAACT 
GGTTGATGTG 3480 

TACACTAATT ACATTTCAGG AGGACTTAAT GCTATTTATG TTGTCCTCTG 
CAGGCAAAGC 3540 

CCTTAATAAA TATTTTACAT CCTTTCTAAT GACAATGAAT GGAATTAATC 
ACTCAACAGG 3600 

TATAGTATTA CGACTCATGT TTACTTTTTA AAATGATTTA GACCGATTTT 
CAGATTTTAT 3660 

TTCGTTATGA TTAAAGATGT CTCATGTACT TGGAAAAGTG AGCATTTTTT 
TTTTTTTTTG 3720 

TATTTCACTT TCATACCAGG CTT A ATGTC A ATGACATTTT TATTTTTGAA 
GTACTCTGAC 3780 

ACCTCCACCC TCTACTTTAT TAGAATTGGA AGGCAAATTT TTGTCCAAAA 
ACCTACAGAC 3840 

AAGTACTTTG A G A G A ATTTC CAATATAATA TTAGACATAA TGATAATTTT 
TTCCATACTC 3900 
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AGAATGAAAA ACTGGATATT ACGTTTTTGT TTTGGGGTTT TTTTGTACAA 
ATTTAGCTAA 3960 

TAGCTACAGG CTGAGAGAAT TGTAACATAG CATGACAAAT TTTGTGTTG A 
CTTGAAAGGA 4020 

ATCACACCAT TATTCCTTAG AAGTAATTAC ATGTGTTCTA AC AC ATTTG A 
GACAGGGTTG 4080 

GACTCCCATT TCTCATCCGA GAAATTACTT AACCCTTCCT GGGCGCTGTA 
CAGTCATCTT 4140 

TTATTCTATT TCCTCTTTGC TGTTTGTAGT AG AG AC ATTT TGAATGAAAC 
TTGGCACTGC 4200 

TTGATTCAAA ACTGTGGAAA CCAGATCTGT TTAGTCTCCT GTTTGTATGC 
GTTTGCTAAT 4260 

GGTAGCTAAA TAACCAGTTT TTGTTGTAAA TGCACCAATT CTGAAGGCAC 
TTTATGTACT 4320 

ACATGGAGGT CATATCTGGT TTTGTTTTTA TTTTTTTATC A TG A AC ATT A 
AATGTGATGA 4380 

TGATTTCTTT TCCCTGCACA CATCTTTCCG GTGCAATATC TATCAATTGT 
GAATCTGGCT 4440 

GCTGGTGTAT AAAAACCTGG ATGTAAAGCT GAGCCTACAG ACCTGTCCTC 
ACCAACTGTT 4500 

TTGTGATTTC TACTCAACTA CAAAGATTTA TTTAATGTAC TCTTAATCTA 
ACTG AGTTTT 4560 

GTTACCAATG ACCTGTTGCA TGCTTCAATA CCGTGTACTG CCTGAGTTGT 
GCCTCTTGTG 4620 

TGCT A G ATT A AAAGTGAGAC AGAGACTTGA CTTG ATCCTC TGAGCCTCAA 
GCTATTGAGC 4680 

TGGTAGTGGC AGAGGACTGA GGGTACCTGC ACAGTTTGAT TCTTTTCCCA 
CGTTGTAAGT 4740 

CTCCATTGCA GAATTGTCGT GCGTTTGAGA AAACACCTGA GGCAGTGTGG 
GAGTTGAACG 4800 

ACCCTGCTGT CCTTTTTAAC CTGTGTTGTC CTAGACCTGT CGGGGCAGTC 
AGGGGACACT 4860 
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AGAGATTTGA TCTCATGCGA GTCATCAATA GGACAAAAAA GTTGTGGTTT 
GGGGAGGTCT 4920 

GTTTGTTACA TAAAAAGGAC CTTTCGGTGT AAGAAATTGC CGTTTTTACC 
CTGCCCTGGC 4980 

TGGCATGTGA GAAGCCATGG AAGGTTGTGG TTGT A A ATG A GTTGTCT A A A 
GGGGTGCAGA 5040 

GGCCTGAGGT TTCTAAAAGA AGGTAGATTT CTACAGAGCT GAGTGTTGGT 
TCCTTTTTCT 5100 

TATTGGTTGA AAATTACCTG GTAGTGATCA GAAAACTTAG ATGCTATGTA ACTC 
5154 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 975 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Phe Arg Leu Arg Ser Trp Ala Ser Ser Thr Thr Gly Scr Arg Tvr 
1 5 10 15 

Gly Ser Ala Phe Cys Gly Ser Pro Thr Leu Ala Trp Cys Val Cys Val 
20 25 30 

Pro Val Cys Tyr Gly Glu Ser Arg He Leu Arg Val Lys Val Val Ser 
35 40 45 

Gly He Asp Leu Ala Lys Lys Asp He Phe Gly Ala Ser Asp Pro Tvr 
50 55 60 

Val Lys Leu Ser Leu Tyr Val Ala Asp Glu Asn Arg Glu Leu Ala Leu 
65 70 75 80 



Val Gin Thr Lys Thr He Lys Lys Thr Leu Asn Pro Lvs Trp Asn Glu 
85 90 95 
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Glu Phe Tyr Phe Arg Val Asn Pro Ser Asn His Arg Leu Leu Phe Glu 
100 105 110 

Val Phe Asp Glu Asn Arg Leu Thr Arg Asp Asp Phe Leu Gly Gin Val 
115 120 125 " 

Asp Val Pro Leu Ser His Leu Pro Thr Glu Asp Pro Thr Met Glu Arg 
130 135 140 

Pro Tyr Thr Phe Lys Asp Phe Leu Leu Arg Pro Arg Scr His Lys Ser 
145 150 155 160 

Arg Val Lys Gly Phe Leu Arg Leu Lys Met Ala Tyr Met Pro Lys Asn 
165 170 175 

Gly Gly Gin Asp Glu Glu Asn Ser Asp Gin Arg Asp Asp Met Glu His 
180 185 190 

Gly Tip Glu Val Val Asp Ser Asn Asp Ser Ala Ser Gin His Gin Glu 
195 200 205 

Glu Leu Pro Pro Pro Pro Leu Pro Pro Gly Tip Glu Glu Lys Val Asp 
210 215 220 

Asn Leu Gly Arg Thr Tyr Tyr Val Asn His Asn Asn Arg Thr Thr Gin 
225 230 235 240 

Trp His Arg Pro Ser Leu Met Asp Val Scr Ser Glu Ser Asp Asn Asn 
245 250 255 

He Arg Gin He Asn Gin Glu Ala Ala His Arg Arg Phe Arg Ser Arg 
260 265 270 

Arg His He Ser Glu Asp Leu Glu Pro Glu Pro Ser Glu Gly Gly Asp 
275 280 285 

Val Pro Glu Pro Tip Glu Thr He Scr Glu Glu Val Asn lie Ala Gly 
290 295 300 

Asp Scr Leu Gly Val Val Leu Pro Pro Pro Pro Ala Ser Pro Gly Scr 
305 310 315 320 

Arg Thr Ser Pro Gin Glu Leu Scr Glu Glu Leu Ser Arg Arg Leu Gin 
325 330 335 



lie Thr Pro Asp Ser Asn Gly Glu Gin Phe Scr Ser Leu lie Gin Arg 
340 345 ' 350 
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Glu Pro Ser Ser Arg Leu Arg Ser Cys Ser Val Thr Asp Ala Val Ala 
355 360 365 

Glu Gin Gly His Leu Pro Pro Pro Ser Val Ala Tvr Val His Thr Thr 

370 375 380 

Pro Gly Leu Pro Ser Gly Tip Glu Glu Arg Lys Asp Ala Lys Gly Arg 
385 390 395 400 

Thr Tyr Tyr Val Asn His Asn Asn Arg Thr Thr Thr Trp Thr Arg Pro 
405 410 415 

lie Met Gin Leu Ala Glu Asp Gly Ala Ser Gly Ser Ala Thr Asn Scr 
420 425 * 430 

Asn Asn His Leu He Glu Pro Gin He Arg Arg Pro Arg Ser Leu Ser 
435 440 445 

Scr Pro Thr Val Thr Leu Xaa Ala Pro Leu Glu Gly Ala Lys Asp Ser 
450 455 460 

Pro Val Arg Arg Ala Val Lys Asp Thr Leu Ser Asn Pro Gin Ser Pro 
465 470 475 480 

Gin Pro Ser Pro Tyr Asn Ser Pro Lys Pro Gin His Lys Val Thr Gin 
485 490 495 

Ser Phe Leu Pro Pro Gly Trp Glu Met Arg He Ala Pro Asn Gly Arg 
500 505 510 

Pro Phe Phe He Asp His Asn Thr Lvs Thr Thr Thr Trp Glu Asp Pro 
515 520 525 

Arg Leu Lys Phe Pro Val His Met Arg Ser Lvs Thr Ser Leu Asn Pro 
530 535 540 

Asn Asp Leu Gly Pro Leu Pro Pro Gly Trp Glu Glu Arg He His Leu 
545 550 555 " 560 

Asp Gly Arg Thr Phe Tyr lie Asp His Asn Scr Lys lie Thr Gin Trp 
565 570 575 

Glu Asp Pro Arg Leu Gin Asn Pro Ala He Thr Gly Pro Ala Val Pro 
580 585 590 



Tyr Ser Arg Glu Phe Lys Gin Lys Tyr Asp Tvr Phe .Arg Lvs Lvs Leu 
595 600 605 




WO 99/06539 PCT/GB98/02259 

-10- 



Lys Lys Pro Ala Asp He Pro Asn Arg Phc Glu Met Lys Leu His Arg 
610 615 620 

Asn Asn He Phe Glu Glu Ser Tyr Arg Arg He Met Ser Val Lys Arg 
625 630 635 " 640 

Pro Asp Val Leu Lys Ala Arg Leu Trp He Glu Phe Glu Ser Glu Lys 
645 650 655 

Gly Leu Asp Tyr Gly Gly Val Ala Arg Glu Trp Phe Phe Leu Leu Ser 
660 665 670 

Lys Glu Met Phe Asn Pro Tyr Tyr Gly Leu Phe Glu Tyr Ser Ala Thr 
675 680 685 

Asp Asn Tyr Thr Leu Gin He Asn Pro Asn Ser Gly Leu Cys Asn Glu 
690 695 700 

Asp His Leu Ser Tyr Phe Thr Phe He Gly Arg Val Ala Gly Leu Ala 
705 710 715 ' 720 

Val Phe His Gly Lys Leu Leu Asp Gly Phe Phe He Arg Pro Phe Tyr 

725 730 * 735 

Lys Met Met Leu Gly Lys Gin He Thr Leu Asn Asp Met Glu Ser Val 
740 745 750 

Asp Ser Glu Tyr Tyr Asn Ser Leu Lys Trp He Leu Glu Asn Asp Pro 
755 760 765 

Thr Glu Leu Asp Leu Met Phe Cys He Asp Glu Glu Asn Phc Gly Gin 
770 775 780 

Thr Tyr Gin Val Asp Leu Lys Pro Asn Gly Ser Glu He Met Val Thr 
785 790 795 ' 800 

Asn Glu Asn Lys Arg Glu Tyr He Asp Leu Val He Gin Tip Arg Phe 
805 ' 810* 815 

Val Asn Arg Val Gin Lys Gin Met Asn Ala Phe Leu Glu Glv Phe Thr 
820 825 830 

Glu Leu Leu Pro He Asp Leu lie Lys He Phe Asp Glu Asn Glu Leu 
835 840 845 



Glu Leu Leu Met Cys Gly Leu Gly Asp Val Asp Val Asn Asp Trp Arg 
850 855 860 
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Gln His Ser lie Tyr Lys Asn Gly Tyr Cys Pro Asn His Pro Val lie 
865 870 875 ' ' 880 

Gin Tip Phe Tip Lys Ala Val Leu Leu Met Asp Ala Glu Lvs Arg He 
885 890 895 

Arg Leu Leu Gin Phe Val Thr Gly Thr Ser Arg Val Pro Met Asn Gly 
900 905 910 

Phe Ala Glu Leu Tyr Gly Ser Asn Gly Pro Gin Leu Phe Thr He Glu 
915 920 925 

Gin Tip Gly Ser Pro Glu Lys Leu Pro Arg Ala His Thr Cys Phe Asn 
930 935 940 

Arg Leu Asp Leu Pro Pro Tyr Glu Thr Phe Glu Asp Leu Arg Glu Lvs 
945 950 955 960 

Leu Leu Met Ala Val Glu Asn Ala Gin Gly Phe Glu Gly Val Asp 
965 970 975 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ACAATGGGGG CGTGGCAGAG AATGGTTCTT CTTACTGTCC AAAGAGATGT 
TTAACCCCTA 60 



CTATGGCCTC TTCGAGTACT CTGCCACGGA CAACTACACA CTTCAGATCA 
ATCCCAACTC 1 20 



AGGCCTCTGT AATGAAGACC ATTTGTCCTA TTTCACCTTC ATTGGAAGAG 
TTGCTGGCCT 1 80 



AGCGGTGTTT CATGGGAAAC TCTTAGATGG ATTCTTCATT CGACCATTCT 
ACAAGATGAT 240 
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GCTGGGGAAG CAGATAACGC TGAACGACAT GGAGTCCGTG GACAGCGAGT 
ACTACAACTC 300 

TTTGAAGTGG ATCTTAGAAA ACGACCCCAC GGAACTTGAC CTCATGTTCT 
GCATAGACGA 360 

GAGAACTTTG GGCAGACATA CCAAGTGGAT CTGAAGCCCA ACGGGTCAGA 
AATAATGGTA 420 

ACCAATGAGA ACAAACGAGA ATACATTGAC TTAGTCATCC AGTGGAGATT 
TGTGAACAGG 480 

GTCCAGAAGC AAATGAATGC CTTCTTGGAG GGATTTACAG AACTTCTTCC 
AATCGACTTG 540 

ATT A A A ATTT TTGATGAAAA TGAGCTGGAG TTGCTGATGT GCGGCCTTGG 
TGATGTCGAC 600 

GTGAACGACT GGAGACAGCA CTCTATTTAC AAGAACGGCT ACTGCCCCAA 
CCACCCTGTC 660 

ATCCAGTGGT TCTGGAAGGC CGTGCTCCTG ATGGATGCTG AGAAGCGCAT 
CCGGTTACTA 720 

CAGTTTGTCA CAGGCACCTC CAGAGTACCC ATG A ATGG AT TTGCCGAACT 
CTATGGTTCC 780 

AATGGTCCTC AGCTGTTTAC AATAGAGCAA TGGGGCAGTC CG AAA A ACT A 
CCAGAGCTCT 840 

ACATGCTTAA TCGC 854 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(it) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



WO 99/06539 



PCT/GB98/02259 



-13- 

His Ala Cys Ser Asn Ala Ala Ser Arg Ala Ala Ala Arg Val Ala Ala 
1 5 10 15 

Arg Cys Thr Ala Arg Ser Arg Ser Gly Arg Arg Ser Ser Ser Val Ser 
20 25 30 

Arg Ser Ser Ser Arg Gly Ala Ser Ser Ser Met Ser Ser Asp Met Ala 
35 40 45 

Ala Asp Ser Ala Val Ser Asp Val Trp Cys Asp Lys Thr Asp Glv Gly 
50 55 60 

Gly Ser Gly Ser Asp Val Thr Asp Thr Cvs Cys Gly Cys Trp Asn Asn 
65 70 75 80 

Ser His Val Thr Ala Asp Tyr His Asn Asp Asp Thr Arg Val Val Arg 

85 90 95 

Val Lys Val Ala Gly Gly Ala Lys Lys Asp Gly Ala Ser Asp Tvr Val 
100 105 110 

Arg Val Thr Tyr Asp Met Ser Gly Thr Ser Val Thr Lys Thr Lys Lys 
115 120 125 

Ser Asn Lys Trp Asn Arg Val Arg His Arg Val Asp Asn Arg Thr Arg 
130 135 140 

Asp Asp Gly Val Asp Val Tyr Thr Asn Arg Met Arg Tyr Thr Lys Asp 
145 150 155 160 

Val His Arg Ser His Lys Scr Arg Val Lys Gly Tyr Arg Lys Met Thr 
165 170 175 

Tyr Lys Asn Gly Ser Asp Asn Ala Asp Ala Gly Trp Val Val Asp Asp 
180 185 190 

Ala Ala Thr His His Ser Gly Trp Arg Asp Val Gly Arg Thr Tyr Tyr 
195 200 205 

Val Asn His Ser Arg Arg Thr Trp Lys Arg Ser Asp Asp Asp Thr Asp 
210 215 220 

Asp Asn Asp Asp Met Ala Arg Ala Thr Thr Arg Arg Ser Asp Val Asp 
225 230 235 240 " 

Gly Asp Asn Arg Ser Asn Trp Val Arg Asp Asn Thr Tyr Ser Gly Ala 
245 250 255 
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Val Ser Scr Gly His Asp Val Thr His Ala Asn Thr Arg Ala Val Cys 
260 265 270 

Gly Asn Ala Thr Ser Val Thr Ser Scr Asn His Ser Ser Arg Glv Glv 
275 280 285 

Ser Thr Cys Thr Val Thr Ser Ser Gly Gly Trp Lys Asp Asp Arg Gly 
290 295 300 

Arg Ser Tyr Tyr Val Asp His Asn Ser Lys Thr Thr Thr Trp Ser Lys 
305 310 315 320 

Thr Met Asp Asp Arg Ser Lys Ala His Arg Gly Lys Thr Asp Ser Asn 
325 330 335 

Asp Gly Gly Trp Arg Thr His Thr Asp Gly Arg Val Asn His Asn Lys 
340 345 350 

Lys Thr Trp Asp Arg Asn Val Ala Thr Gly Ala Vai Tyr Ser Arg Asp 
355 360 365 

Tyr Lys Arg Lys Tyr Arg Arg Lys Lys Lys Thr Asp Asn Lys Met Lys 
370 375 380 

Arg Arg Ala Asn Asp Ser Tyr Arg Arg Met Gly Val Lys Arg Ala Asp 
385 390 395 400' 

Lys Ala Arg Trp Asp Gly Lys Gly Asp Tyr Gly Gly Val Ala Arg Trp 
405 410 415 

Ser Lys Met Asn Tyr Tyr Gly Tyr Scr Ala Thr Asp Asn Tyr Thr Asn 
420 425 ' 430 

Asn Ser Gly Cys Asn Asp His Ser Tyr Lys Gly Arg Val Ala Gly Met 
435 440 445 

Ala Val Tyr His Gly Lys Asp Gly Arg Tyr Lys Met Met Lys Thr His 
450 455 460 

Asp Met Ser Val Asp Scr Tyr Tyr Scr Ser Arg Trp Asn Asp Thr Asp 
465 470 475 480 

Arg Asp Gly Thr His His Lys Thr Gly Gly Ser Val Val Thr Asn Lys 



485 



490 



495 



Asn Lys Lys Tyr Tyr Val Trp Arg Val Asn Arg Lvs Met Ala Ala Lvs 
500 505 510 
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Gly Asp Lys Asp Asn Met Cys Gly Gly Asp Val Asp Val Asn Asp Trp 
515 520 ' 525 

Arg His Thr Lys Tyr Lys Asn Gly Tyr Ser Met Asn His Val His Trp 
530 535 540 

Trp Lys Ala Val Trp Met Met Asp Ser Lys Arg Arg Val Thr Gly Thr 
545 550 555 560 

Xaa Ser Arg Val Met Asn Gly Ala Tyr Gly Ser Asn Gly Ser Thr Val 
565 570 " 575 

Trp Gly Thr Asp Lys Arg Ala His Thr Cys Asn Arg Asp Tyr Ser Asp 
580 585 590 

Trp Asp Lys Met Ala Asn Thr Gly Asp Gly Val Asp 
595 ' 600 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TGCTGCAAGT GACAGGTTCC AAGAAGCCCG AGGGCTCAGA GCTGAATGAT 
GAAGCGCAGT 60 

CCCCAAAGTG CCTGGCCACC CCTCCCTCCC TGGATCACTG CTGCCTGGGC 
TTGATTGATT 120 

GATTGATTGA TTGATTGATT GATTTTGAGA GAGATTCTCA CTGTCACCCA 
GGCTGGAGTA 180 

CAGTGGTGCG ATCTCGGCTC ACTGCAGCCT CTGCCTCCCG GGTTCAAGCA 
ATTCTCCTGC 240 

CTCAGCCTCC CAAGTAGCTG GGACTACAGG CACGCGCCAC CACACCCAGC 
TAATTTTGTA 300 
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TTTTT AGT A A AAGACGGGGT TTCACCATGT TGGGCCAGGA TGGTCTTGAT 
CTCCTGACCT 360 

CATGATCCAC CCGCCCCGGC TTCCAAAGTG CTGGGATACA GGCATGAACC 
CGACGCGCCC 420 

AGCATGGACA TTTTTTTTTA ATCCCCTGCC CTTTTCTTGG GCATAATTCA 
TTGCAGGTCT 480 

CTTCTATACA GATCATGGAA AACACATTTT CTTAACTGAG TTTTATTATT 
TATACCCAGC 540 

ACCTCATGAC ATTTACCCTG TTACAACAAA ATGGGCACCT GCCAAAACAA 
CTTTATATAA 600 



GGATGCTCCA GGCCT 
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